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Title: An in 



vitro irethod for construction of a DNA Ubrary 



5 FIELD OF THE IMVENTIOM 

The present invention relates to optimizing DNA sequences 
in order to (a) improve the properties of a protein of interest 
by artificial generation of genetic diversity of a gene encoding 
the protein of interest by the use of the so-called gene- or DNA 
a,Q«-^huffiing technique to create a large library of "genes", ex- 
pressing said library of genes in a suitable expression systen-, 
and screening the expressed proteins in respect of specific char- 
acteristics to deteruiine such proteins exhibiting desired proper- 
ties or (b) iiT.prove the properties of regulatory elements such as 
15 promoters or terrr.inators by cer.eration of a Ubrary of these ele- 
ments, transforming suitable hosts therewith in operable conjunc- 
tion with a structural gene, expressing said structural gene -and 
' 'Screening for desirable properties in the regulatory element. 



20 



25 



BACKGROUND or THS INVENTION 

It is generally four.d that a protein performing a certain 
bioactivity exhibits a certair. variation between genera and even 
between members of the sar.e species differences may exist. This 
variation is of course even more outspoken at the genomic level. 

This natural genetic diversity among genes coding for pro- 
teins having basically the sarr.e bioactivity has been generated in 
Nature over billions of years end reflects a natural optimization 
of the proteins coded for ir. respect of the environment of the 
orcanisra in question. 
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In today's society the conditions of life are vastly re- 
moved from the natural environment and it has been found that the 
naturally occurring bioactive molecules are not optimized for the 
various uses to which they are put by mankind, especially wh=n 
5 they are used for industrial purposes. 

It has therefore been of interest to industry to identify 
such bioactive proteins that exhibit optimal properties in re- 
spect of the use to which it is intended. 

This has for many years been done by screening of natural 
Tff^sources, or by use of mutagenesis. For instance, within the tech- 
nical field of enzymes for use in e.g. detergents, the washir.g 
and/or dishwashing performance . of e.g. naturally occurring pro- 
teases, lipases, amylases and cellulases have been improved sig- 
nificantly, by 'in vitro modifications of the enzymes. 
15 In most cases these improvements have* been obtained by 

site-directed mutagenesis resulting in substitution, deletion or 
insertion of specific amino acid residues which have been chosen 
either on the basis of their type or on the basis of their loca- 
tion in the secondary or tertiary structure of the mature enzv-e 
20 (see for instance US patent no. 4,518,584). 

In this manner the preparation of novel polypeptide vari- 
ants and mutants, such as novel modif''ied enzym.es with altered 
characteristics, e.g. specific activity, substrate specificity, 
thermal, pH and salt stability, pH-optimum, pi, Kj^, V^^^v etc., 
25 has successfully been perforraed to obtain polypeptides with im- 
proved properties. 

For instance, within the technical field of enzymes the 
washing and/or dishwashing performance of e.g. proteases, li- 
pases, amylases and ceiluiases have been improved significantly. 
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^ An alternative general approach for modifying proteins and 

enzymes has been based on random mutagenesis, for instance, as 
disclosed in OS 4,894,331 and WO 93/01285 

AS it is a cumbersome and time consuming process to obtain 
5 polypeptide variants or. mutants with improved functional prop- 
erties a few alternative methods for rapid preparation of modi- 
fied polypeptides have been suggested. 

Weber et al., (1983), Nucleic Acids Research, vol. 11, 
5661, describes a method for modifying genes by in vivo recom-^ 
.--10— m-riation between two homologous genes. A linear DNA sequence com- 
prising a plasmid vector flanked by a DNA sequence encoding al- 
pha-1 human interferon in the 5 ' -end and a DNA sequence encoding 
aloha-2 human interferon in the 3 '-end is constructed and trans- 
fected into a fee A positive strain of £. coii. Recombinants w^re 
15- identified and isolated using a resistance marker. I 

Pompon et al., (1989), Gene 83, p. 15-24, describes- a 
^1 method for. shuffling gene dor.ains of mammalian cytochrome P-450 

^ 7 ' ' bv in vivo .recombination of partially homologous sequences . in 

'.■S'^c^aro:.yces cerevisiae by transforming Saccharomyces cerevisiae 
20 with a linearized plasmid with filled-in ends, and a DMA fragment 
. being oartially honolocous to the ends of said plasmid. 

m KO 97/07205 a -echod is described whereby polypeptide 
' variants 'are prepared by shuffling different nucleotide sequences 
of homologous DMA sequences by in vivo recombination using plas- 

25 micic DNA as te.'nplate. 

US patent no. 5,093, 257 (.Assignee: Genencor Int. Inc.) dis- 
closes a method for producing hybrid polypeptides by in vivo re- 
corriination. Hybrid DNA sequences are produced by forming a cir- 
cular vector comprising a replication sequence, a first DNA se- 

30 cuence encoding the amino-ter...nal portion of the hybrid poly- 
oeotide, a. second DNA sequence encoding the carboxy-terminal por- 
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tion of said hybrid polypeptide. The circular vector is trans- 
formed into a rec positive microorganism in which the circular 
vector is amplified. This results in recombination of said circu- 
lar vector mediated by the naturally occurring reccnbinatic?. 
5 mechanism of the rec positive microorganism, which include prc- 
karyotes such as Bacillus and £. coli, and eukaryotes such as 
Saccharomyces cerevisiae. 

One method for the shuffling of homologous DNA sequences 
has been described by Stemmer (Stemmer, (1994), Proc. Natl. P-.caz. 

10 Sci. USA, Vol. 91, 10747-10751; Stemmer, (1994), Nature, vol. 
370, 389- 391) . The method concerns shuffling homologous DN*A se- 
quences by using in vitro PCR techniques. Positive recombinant 
genes containing shuffled DMA sequences are selected from a D\l-, 
library based oh the improved . function of the expressed proteins. 

15 The above method is also described in WO 95/22625. KO 

95/22625 relates to a method for shuffling of homologous DNA s=- 
' quences. An important step in the method described in WO 95/22625 
is to cleave the homologous te.T.plate double-stranded poiynuclec- 
tide into random fragments of a desired size followed by homolc- 

20 gously reassembling or the fragments into full-length genes. 

A disadvantage inherent to the method of WO 95/22625 is, 
however, that the diversity generated through that method is lim- 
ited due to the use of homologous gene sequences (as defined in 
WO 95/22625) . 

25 Another disadvantage in the method of WO 95/22625 lies in 

the production of the random, fragments by the cleavage of the 
template double-stranded polynucleotide. 

A further reference of interest is WO 95/17413 describing a 
method of gene or DN.^. shuffling by recombination of specific CN'A 

30 sequences - so-called design elements (DE) - either by recombina- 
tion of synthesized double-stranded fragments or recombination cf 
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r^rndnce so-called functional elements 

,e„„.t.. \ r;;;,, ........ 

ferent sequences to be recombined. ■ 
lerenu a h „„^,iis the disadvantage tn=^ 

WO 95/17-! 13 therefore also entails the 

relatively limited. Furthermore .ne 

z:r:::i:::z - 

automatisation. ^ 

Despite the existence of the above methods there .s st... 
„e»d for better iterative in .icro recombination methods for pre- 
- • qnch methods should also be 

paring novel polypeptide varxan.s. Such m.t . . 
capable of being performed in' small volumes, and amenab.e tp 
automatisation. ^ 

'^"--^-T^ZZZi:^^^^ concerns briefly the ut.U.at.on o 
_.a e Shift Of a ne.ly synthes.ed 0.. strand d^ing .n v. 
;"svnthesis in order - to achieve 0.. shuffling. By using 

.n-P^" such results in a more exoeai 
t=---.icue it is possible to oD.ai.. su..n , 
' so-.= extent even a greater variation than m, 

ent manner, ana 

fr- above mentioned r.ethods. ^ 

. .-^o sc; also very well suited ^or 
The method of the invention is a.so v.ry 

^^^^^tr"— - technigue is used in combina- 
.... :::: error-prone po..erase thereby introducing an even 
greater variation in the library created. ^^^^ 
More so^c^fically the present invention relates 
More sp.c-t ^ recombined homologous 

) fo'- the construction o- c: --s-s-y 

. , , , of different starting single o. 

polynucleotioes from a n 
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double stranded parental DNA templates and primers by induce 
template shifts during an in vitro polynucleotide synthesis usin 
a polymerase, whereby 

A. extended primers or polynucleotides are synthesized by 

a) denaturing parental double stranded DNA templates z 
produce single stranded templates, 

b) annealing said primers to the single stranded DNA tem- 
plates, 

_ c) extending said primers by initiating synthesis by use c: 
said polymerase, 

d) cause arrest of the synthesis, and 

e) denaturing the double, strand to separate the extender 
primers from the templates, 

B. a template' shift is induced by 

a) isolating the newly synthesized single stranded extencrd 
primers from the templates and repeating steps A.b) zo 
A,e) using said extended primers produced in (A) as boih 
primers and templates, or 

b) repeating steps A.b) to A.e), 

C. the above process is cerminated after an appropriace nurri-er 
of cycles of process steps A. and B.a), A. and B.b), or ccr.- 
binations thereof, and 

D. optionally the produced polynucleotides are amplified in a 
standard PGR reaction with specific primers to selectively 
amplify homologous polynucleotides of interest. 

In specific embodiments various modifications can be mace 
in the process of the invention.' For example it is advantagec-js 
to apply a defective polymerase either an error-prone polymerase 
to -introduce mutations in cor.parison to the templates, or a pc 
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lymerase that will discontinue the polynucleotide synthesis pre- 
maturely to effect the arrest of the reaction. 

Further modifications will be described below. 
In a further aspect the invention relates to a method of 
identifying polypeptides exhibiting improved properties -in com- 
parison to naturally occurring polypeptides of the same bioactiv- 
ity, whereby a library of recombined homologous polynucleotides 
produced by the above process are cloned into an appropriate vec- 
tor, said vector is then transformed into a suitable host system, 
T^* be expressed into the corresponding polypeptides and dis- 
played, said polypeptides are then screened in a suitable assay, 
and positive results selected. . 

in a still further aspect the invention relates to a method 
.for producing a polypeptide of interest as identified in the pre- 
ceding process, .whereby a vector comprising a polynucleotide en- 
coding said polypeptide is transformed into a suitable host, said 
.host is grown to express said polypeptide, and the polypeptide 
recovered and .purified. ■ .... 

. Finally, in further final" aspects the invention 
relates to a recombined/shuf f led protein, which is obtainable by 
anv of the methods according to the invention, and which is' a 
■recon-bined/shuffled protein comprising the sequences disclosed 

herein (vide infra] . 

In those final aspects of the invention, the ter:. 
"Obtainable" denotes that said protein is preferable obtained by 
a m.ethod according to the invention. However a prior art known 
recombination/shuffling technique such as those described in v:o 
95/22625 or WO 95/17413 may be . used too, either alone or m 
combination with a method according to the invention, in order to 
obtain said recombined protein. 

Accordingly, further final aspect of the invention are; 
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a recombined/shuf f led protease obtainable by any of the 
methods according to the invention, and comprising a recombinec 
sequence, which at least contain two different partial sequences 
from at least two different wild-type proteases; 

a recombined/shuf fled lipase obtainable by any of the 
methods according to the invention, and comprising a recombined 
sequence, which at least contain two different partial wild-type 
sequences 

a recombined/shuf fled Pseudomonas lipases obtainable by any 

of the methods according to the invention, and comprising a 
reco-jDined sequence, which at least contain two different partial 
sequences from at least two of the different wild-type 
Pseudomonas lipases; 

a recornjDined/shuff led xylanase obtainable by any of the 
methods according to the invention, and comprising a recombine:: 
sequence, which at least cental- two different partial sequences 
from at least two 'different wild-type xylanases; ^.^^ 

a recorrJDined/shuff led celluiase obtainable by any of the 
m-2thcds according to the invention, and . comprising a recombinec 
sccy,cr,cii, which at least contain two different partial sequences 
rroTi at least two cifferonr wild-type ceilulases; 

a recombined/shuf f lei amylase obtainable by any of the 
methods according to the invention, and comprising a recombined 
sequence, which at least contain two different partial sequences 
from at least two different wild-type amylases 

a' recombined/shuf fled laccase obtainable by any of the 
methods according to the invention, and comprising a recombined 
soqucnco, which at least contain two different partial sequences 
from at least two different wild-type laccases; 

a recombined/shuf fled p'nytase obtainable by. any of the 
method:; nccording to the invention, and comprising a recorrjDined 
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sequence, which at least contain two different partial sequences 
from at least two different wild-type phytases. 
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DEFINITIONS 

Prior to discussing this invention in further detail, the 
following terms will first be defined. 

"Shuffling": The term "shuffling" means recombination of 
nucleotide sequence fragment (s) between two or more homologous 
polynucleotides resulting in output polynucleotides (i.e. polynu- 
cleotides having been subjected to a shuffling cycle) having a 
nunb- of nucleotide fragments exchanged, in comparison to the 
input polynucleotides (i.e. starting point homologous polynucleo- 

"Homology of DN.^ sequences or polynucleotides" In the pres- 
ent context- the degree of DM., sequence homology is determined as 
th. degree of identity betwee.V two sequences indicating a deriva- 
tion of the first sequence from the- second.- The homology may 
suitably be determined by means of Computer programs known in the 
a-t such as GA? provided in the GCG program package (Program 
«a^-ual fbr the Wisconsin Parage, Version 8, August 1994, Genet- 
ics ' Computer Group, 575 Science Drive, Madison, Wisconsin, USA 

J M ^^'o r n (1970), Journal of Mo- 
63711) (Needleman, S.3. and Wunscn, CD., Ui"u), 

lecular Biology, -iS, 443-453). 

"Homologous": The term "homologous" means that one single- 
stranded nucleic acid sequence may hybridize to a complementary, 
• single-stranded nucleic acid sequence. The degree of hybridiza- 
tion may deoend on a nu.l^er of factors including the amount or 
identity between the sequences and the hybridization conditions 
30 such as temperature and sale concentration as discussed later 
{vide infra) . 



25 
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Using the computer program GAP (vide supra) with the fol 
lowing settings for DNA sequence comparison: GAP creation penalz 
of 5.0 and GAP extension penalty of 0.3, it is in the preser. 
context believed that two DNA sequences will be able to hybridize 
(using low stringency hybridization conditions as defined belovj 
if they mutually exhibit a degree of identity preferably of a- 
least 70%, more preferably at least 80%, and even more preferably 
at least 85%. 

"heterologous": If two or more DNA sequences mutually ex- 
hibit a degree of identity which is less than above specified, 
they are in the present context said to be "heterologous". 

"Hybridization:" Suitable experimental conditions fcr 
determining if two or more DNA sequences of interest do hybridize 
or not is herein defined as . hybridization at low stringency as 
described in detail below. 

Molecules to which the oligonucleotide probe hybridizes 
• under these conditions are detected using a x-ray film or a phos- 
phoinager. 

"prim.er": The ter~. "primer" used herein especially in ccn- 
. nection with a PGR reaction is an oligonucleotide (especially a 
"?CR-primer" I defined and constructed according to general stan- 
dard specifications knov.n in the art ("PGR A practical approacr." 
IRL Press, (1991) ) . 

"A primer directed to a sequence:" The term "'a primer di- 
rected to a sequence" means that the primer (preferably to be 
used in a PGR reaction) is constructed to exhibit at least 80^ 
degree of sequence identity to the sequence fragment of interest, 
more preferably at least 9C% degree of sequence identity to the 
sequence fragment of interest, v;hich said primer consequently is 
''directed to". The primer is designed to specifically anneal at 
the sequence fragment or region it is directed towards at a given 
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temperature. Bspeci.Uy identity at the 3' end o. the "primer is 

"""irun.ing" The tenn "fian.ing" used herein in connection 
With DMA sequences comprised in a PCR-fra^ent .eans the outer- 
S .ost partial sequences o. the PCK-f rag^ent, both xn the S and 3 

ends of the PGR fragment. 

>>Polypeotide" Polymers o. a.ino acids so.etin.es referred 
to as proteins. The sequence of a.ino acids 
folded conformation that the polypeptide assu.es and th.s 
— n determines biological properties and act.v.ty. So. 
polv=eptides consist of a single polypeptide chain (.ono.erxc, , 
whe'reas other comprise several associated polypep..oes 
(^ultimeric, . .iren.y..es and antibodies are polypeptides^ 

"Enzyme" A protein capable of catalysing cnemxcal rea^ 
tloas.^ specific types of enzymes to be mentioned are such as 

ur.^.^r^^^ses lipases, cellulases, oxa- 
Amvlases, oroteases, carbohydrases, iip^ 

^ * ,.c-- 0^ soecific interest in relation 

■^doreductases, esterases, e.c. 0. sp-ci 

. v^ed in detergents, such as 

to the present invention are enzynes used m . . y 

oroteases, lipases, cellulases, amylases, etc. 



15 
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P^^-.._.r. n^-^TPTPT ioM OT THS I^:v^ ^!TIO^■ 

■ ' , ^ She: fi^st asoect to a 

The oresent inventior. relates m its fi-st . 

p library of recombined homologous 
m=thod for the construction o. a liorary o. . , 

■ ^ r.f different starting single o. 
25 ooWnucleotides from a numosr of Qifreren 

a Dolymerase, whereby 

1 ^1 enr ides are synthesized by 
A. extended primers or polynucleouiaes , . 

.- = 1 rinuble stranded DNA template to 
30 . a) denaturing a parental douole 

oroduce single stranded templates, 
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b) annealing said primers to the single stranded DNA te 
plates; 

c) extending said primers by initiating synthesis by use 
said polymerase, 

d) cause arrest of the synthesis, and 

e) denaturing the double strand to separate the extend 
primers from the templates, 

B. a template shift is induced by 

a) isolating the newly synthesized single stranded extends 
primers from the templates and repeating steps A.b) z 
A.e) using said extended primers produced in (A) as bcz 
primers and templates, or 

b) repeating steps A.b) to A.e), 

C. the above 'process is terminated after an appropriate numbe 
of cycles of process steps A. and B.a), A. and B.b), or cor. 
binations thereof, and 

D. optionally the produced polynucleotides are amplified in 
standard PGR reaction with specific primers to seiectivel: 
arr.plify homologous polynucleotides of interest. 

According to the invention the polymerase m.ay be a DNA or . 
RNA polymerase, specific poiyrr.erases to fee mentioned are such =. 
DNA polymerases like T4 polymerase, T7 polymerase , E. coli DS. 
polymerase I or the Kienow fragment of DMA polymerase I, or ther 
mostable polymerases such as Taq, Ampiitaq, Vent, Pwo . 

One of the advantages of the invention is that it ma)ces i 
possible to control the length of the extension of the primers i 
the reaction in a convenient rr.anner. 

This can be accomplishec by various means such as choice ; 
polymerase, the physical and chemical conditions during the az 
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tion of the polymerase, e.g. pH, temperature, buffer/ salt con- 
centration, and addition of various chemicals. 

It is known that various polymerases carry out the Dm syn- 
thesis at different rates (nucleotides incorporated pr. second) . 
For example has the Klenow fragment of DNA polymerase I a limited 
extension rate compared to e.g. the Taq polymerase (SanO^rook et 

al. 1989) . ^. ^ 

polymerases also display differences in processivity, which 

is the average number of nucleotides incorporated before the po- 
10 T;;;;=rase dissociates from the template/extended primer; again the 
Klenow polymerase is an example of a polymerase with limitec 
processivity. 

The choice of polymerase is therefore an important means in 
controlling the average extension of the primers. 

These conditions may also exert an influence on the fidel- 
i.v of the polymerase (the rate by which point mutations are .xn- 
Woduced; HIV reverse- transcriptase is an- example of a polymerase 
of low fidelity), a parameter useful .in combining shuffling and 
mutagenesis. 

- ■ in so»cific errboaine.nts - various modifications can be mace 
o^ocess of the invention.. Tor example it is advantageous 
to aooly a defective polyrr.erase either an error-prone pol.^erase 
to introduce mutations in comparison to the templates, ■ or a po- 
IvT.rase that will discontinue the polynucleotide synthesis pre- 
25- maturely to effect the arrest of the reaction. Such a defective 
polymerases that could be mentioned' is a Klenow polymerase having 

low orocessivity . 

" in another embodiment of the invention polymerase will be 

1= i-- -•.^^-oolvm'^^rase used is not thermo- 
added after each cycle, ^--^ poiym-rci:> 

30 stable. 
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According to the invention the starting single or double 
stranded parental templates may be different in that they contair. 
different point mutations in the same native polynucleotide 
(gene), or they can be homologous polynucleotides (genes) iso- 
lated from nature, which may be amplified by PCR, or they can ze 
combinations thereof. The templates used in the process of the 
invention are hereby homologous showing an identity at the DN*A 
level of e.g. more than 95%, 90%, 85%, 80%, 75%, 70%, 65%, 60*, 
55%, or even more than 50% identity. 

It may be advantageous to use pre-selected templates ccz- 
prising mutations with improved properties of interest. The pres- 
ent recombination method of the invention will then recombir.e 
said improved mutations for subsequent screening for even further 
improvements in* the properties of interest. 

Said pre-selected templates with improved properties of in- 
terest may have been identified by standard procedures in the azz 
coxprising e.g. i) error-prone PGR of te.T\piates of interest fol- 
lowed by ii) screening/seleccion for templates with improved 
characteristic of interest. The mutagenesis frequency (low or 
high mutagenesis frequency) of the error-prone PGR step is pref- 
erably adjusted in relation to the subsequent screening capacity, 
i.e. if the screening capacity is limited the error-prone ?CR 
freauency is preferably low (i.e.- one to two mutations in each 
te~plate) (see WO 92/186'3 5 for further details) . 

The arrest of the polymerase reaction in step A.d) may as 
indicated above be obtained in different ways, such as by raising 
the temperature, or adding specific reagents as described in v;o 
95/17413. 

When raising the temperature for this purpose, it is pre- 
ferred to use temperatures between 50°C and 99°C, 
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When using chemical agents DMSO is a possibility. Appropri- 
ate procedures are mentioned in e.g. WO 95/17413. 

The process of the invention uses annealing of the primers 
to the templates in step l.b. In this context the annealing may 
5 be random or specific, meaning either anywhere on the polynucleo- 
tide or at a specific position depending on the nature of the 
primer. 

Also, the primers to be used may be completely random prim- 
ers (NNNNNNNNNNN) (N meaning a mixture of the four bases (A, T, 
,j^.O-=».^c) is used at a particular position in the primer during syn- 
thesis), semi-random primers, or specific primers. 

If the. extended primers produced are to be separated from 
the primers during the process it is convenient to use labeled 
te-Tolates in order to provide a simple means for separation, .a 
15 preferred marker is biotin or digoxigenin. • 

According to the invention the nuinber of cycles necessary 
. will be less than 500, in most cases less than 200, and normally 

less than 100 cycles. 

■ In an embodiment of the invention the above in vitro shuf- 
20 flir.g is corrJ^ined with a subsequent in vivo shuffling by methods 
such as those describee in WO 97/0'7205. 

In its second aspect the invention relates to -a method of 
identifying polypeptides exhibiting improved properties in com- 
parison' to naturally occurring polypeptides of the same bioactiv- 
25 ity, whereby a library of recombined homologous polynucleotides 
produced by the above process are cloned into an appropriate vec- 
tor, said vector is then transformed into a suitable host system, 
to be expressed into the corresponding polypeptides, said 
polypeptides are then screened in a. suitable assay, and positive 
30 polyDeptides selected. 
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In an embodiment it is contemplated that the polypeptide 
of interest encoded by the shuffled library are expressed as 
suitable fusion protein (e.g. as a hybrid with glll of bac 
teriphage M13/fd) in order to display said recombined polypeptic 
on the surface of phage or bacteria. 

In a third aspect the invention relates to a method fc: 
producing a polypeptide of interest as identified in the preced- 
ing process, whereby a vector comprising a polynucleotide encod- 
ing said polypeptide is transformed into a suitable host, saiz 
«:host is grown to express said polypeptide, and the polypeptide 
recovered and purified. 

The use of partial random (semi-random) or completely ran- 
dom primers (mixtures of bases in a selected number or all posi- 
tions in the primer) as initiation point for the DNA synthesis 
provide certain novel possibilities for the combined use of shuf- 
fling and random mutagenesis. 

It is often associated with difficulties to obtain an i.t 
vitro recombination of polynucleotides that display relatively 
limited homology. By the use of an e.Tbodiment of the inventicr. 
even very diverse poiynucieotides can be forced to recorrJoine. 

.^i.ccording to that errJoodiment at least two templates (or two 
pools of diverse ter.plates) are applied. . The novel synthesis c: 
the one polynucleotide can then be based on only one strand (i.e. 
either the sense or the anti-sense strand) , and the synthesis of 
the other polynucleotide is based the opposite strand. 

This can be accomplished by isolating the complementary 
strands from the two templates, e.g. by having these strands la- 
beled by biotin. Synthesis of DNr. is initiated by annealing ei- 
ther specific, partly randox or completely random primers 
these templates and adding a suitable polymerase. This can re 
performed as either separate reactions for the different ten- 
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plates or in just one reaction. Synthesis should preferably be 
performed under conditions that favor production of relatively 
. short new fragments. These fragments can subsequently be isolated 
from the templates based on the affinity label. A PCR reaction is 
5 carried out on these fragments and as the starting material 
originates from two different strands, the newly synthesized 
fragments must recoa>bine in order to produce full length PCR 
products - a kind of forced recombination. 

Also for this embodiment rapid PCR with short or no exten- 
'IT^n time can be applied advantageously in order to enhance re- 
combination, especially if pools of templates are used for tne 
two strands. 

The length 'of the primer and the annealing temperature 
■ utiUzed in the^ process determines if random primers will anneal 
15 and the nu:^er of mismatches between the template and the primer 
that can be accorr^odated. By varying the primer .length ■ and .the 
annealing temoerature the method of the invention provides a 
™eans for achieving random nutacenesis within a certain nucleo- 
,id= window representing the length of- the primer-. The method o: 
invention thereby provides substantial benefits. compared . to 
random mutagenesis approaches, especially the high prob- 
abi-Uty for several base substitutions close to each other in tne 
sequence, e.g. the use of a completely random 20'mer 
(,.vtu^e of all four nucleotides in all 20 positions) will a=- 
25 cording to theory under given experimental conditions give a cer- 
tain reasonably high probability for having several base subst.- 
- tutions close to each other. 

Error prone PCR (=high mutagenesis frequency PCR) does not 
provide this possibility. Erroc prone PCR provides a very low 
orobability for having more than one base substitution within one 
codon (coding for one amino acid in' a translated polypeptide). 
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Obviously the substitution of only one base within a codor 
doesn't provide total random mutagenesis (at protein level) as 
only a limited set of amino acid substitutions can be obtained bv 
one base substitution at DNA level (e.g. Methionine encoded by 
5 ATG-codon requires three base substitution to become the TGT or 
TGC-codon encoding Cysteine) . 

In one embodiment of the invention the process is therefore 
performed by using random or semi-random primers having a length 
of from 6 to 200 bp, preferably from 10 to 70 bp, and better fro" 
-*-T0-=^'=5-~ to 4 0 bp. 

One of the advantages in the method of the invention is the 
robustness. In some embodiments the constant presence of full 
length template provides a further advantage avoiding FCR con- 
tamination problems. Furthermore it is much less laborius, with 
15 less hands on, than other described methods, thereby providinc 
excellent possibilities for autc-ation. 

PCR-primers : 

The PGR primers are constructed according to the standard 
20 descriptions . in the art. Norrr^ally they are 10-75 base-pairs (bp) 
long. However, for the specific er±>oGiment using random cr semi- 
random primers the length -ay be substantially longer as indi- 
cated above. 

2 5 PCR-reactions : 

If not otherwise mentioned the PCR-reactioa performed ac- 
cording to the invention are performed according to standard pro- 
tocols known in the art. 

The term "Isolation of rcF: fragment" is intended to cover 
30 as broad as simply an aliq-jov. containing the PGR fragment. How- 
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ever preferably the PCR fragment is isolated to an extend which 
remove surplus of primers, nucleotides templates etc.. 

in an embodiment of the invention the DNA fragment (s) 
is (are) prepared under conditions resulting in a low, medium or 
high random mutagenesis frequency. 

TO obtain low mutagenesis frequency the DNA sequence (s) 
(comprising the DNA fragment(s)) may be prepared by a standard 
PCR amplification method (US 4,683,202 or Saiki et al., (1988), 

Science 239, 487 - 491). 

^ ^^^.^^ ^.gj, mutagenesis frequency may be obtained by 

performing the PCR amplification under conditions which increase 
the misincorporation of nucleotides, for instance as described by 
■Deshler, (1992), GATA 9(4), 103-106; Leung et al., (1989), Tech- 
■nique. Vol. 1, No. 1, 11-15. ' 
15 It is also contemplated according to the invention to com- 

bine the PCR amplification (i.e. according, to this embodiment 
. --.^al'sd DNA fragment mutation) with- a mutagenesis step using a suit- 
able ohysical or chemical mutagenizing agent, e.g., one which in- 
duces' transitions, transversior.s, inversions, scrarriling, dele- 
20 tions, and/or insertions. • . . 
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the recorrjinant protein from reco:T..binant shuffled 



sequences 

.■ Expression of the recombinant protein encoded by the shuf- 
fled sequence in accordance with the second and third aspect of 
the present invention may be performed by use of standard expres- 
sion vectors and corresponding expression systems known in the 

art - 
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In the context of the present invention the term "positiT 
polypeptide variants" means resulting polypeptide variants po= 
sessing functional properties which has been improved in compari 
son to the polypeptides producible from the corresponding inpu 
DNA sequences. Examples, of such improved properties can be a 
different as e.g. enhance or lowered biological activity, ir. 
creased wash performance, thermostability, oxidation stability, 
substrate specificity, antibiotic resistance etc. 

Consequently, the screening method to be used for identify- 
ing positive variants depend on which property of the polypeptide 
in question it is desired to change, and in what direction tr.e 
change is desired. 

A number of suitable screening or selection systems zz 
screen or select for a desired biological activity are described 
in the art. Examples are: 

Strauberg et al. (Biotechnology 13: 669-673 (1995)- de- 
scribes a screening syste™ for subtilisin variants having Cal- 
ciu.T.-independent stability; 

Bryan et ai. (Proteins 1:326-334 (1986)) describes i 
screening assay for proteases having a enhanced therrr.al stabil- 
ity; and 

PCT-DK96/00322 describes a screening assay for lipases haT- 
inc improved wash perforr\ance in washing detergents, 

An errhodiment of the invention comprises screening or se- 
lection of recombinant protein (s), wherein the desired biological 
activity is performance in dish-wash or laundry detergents. Exa.-- 
ples of suitable dish-wash or laundry detergents are disclosed ir 
PCT-DK96/0a322 and WO 95/30011. 

If, for instance, the polypeptide in question is an enzyn^ 
and .the desired improved functional property is the wash perforr. 
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ance, the screening may conveniently be performed by use of a 
filter assay based on the following principle: 

The recombination host cell is incubated on a suitable me- 
dium and under suitable conditions for the enzyme to be secreted, 
the medium being provided with a double filter comprising a first 
protein-binding filter and on top of that a second filter exhib- 
iting a low protein binding capability. The recombination host 
cell is located on the second filter. Subsequent to the incuba- 
tion, the first filter comprising the enzyme secreted from the 
•^^^>ination host cell is separated from the second filter- com- 
prising said cells. The first filter is subjected to screening 
for the desired enzyrr.atic activity and the corresponding micro- 
bial colonies present on the second filter are identified. 

The filter used for binding the enzymatic activity may be, 
any orotein binding filter e.g. nylon or nitrocellulose. The to?-, 
filter carrying the colonies of the expression organism may be. 
any filter that has no or low affinity for binding proteins e.g. 
cenu^ose acetate br Durapore®. The filter may be pre-treated 
with . any of the conditions to be used for screening or may be 
treated during the detection of enzyrr.atic activity. 

The enzymatic activity ..ay be detected by a dye, fluores- 
cence, precipitation, pH indicator, IR-absorbance or any other 
Vnown technique for detection of enzymatic activity. 

Th» detecting compound rr.ay be iir^obilized by any immobiliz- 
25 ing agent e.g. agarose, agar, gelatin, polyacrylamide, .starch, 
niteVpaoer, cloth; or any combination of immobilizing agents. 

If tho imoroved functional property of the polypeptide ■ is 
not sufficiently good after one cycle of shuffling, the polypep- 
tide may be subjected to another cycle. 

. in an errO^odiment of the invention wherein homologous 
polynucleotides representing a nun^er of mutations of the same 
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gene is used as templates at least one shuffling cycle is a bacx- 
crossing cycle with the initially used DMA fragment, which may be 
the wild-type DNA fragment. This eliminates non-essential muta- 
tions. Non-essential mutations may also be eliminated by using 
5 wild-type DNA fragments as the initially used input DNA material. 
Also contemplated to be within the invention is polypep- 
tides having biological activity such as insulin, ACTH, glucagon, 
somatostatin, somatotropin, thymosin, parathyroid hormone, pitu- 
ary hormones, somatomedin, erythropoietin, luteinizing hormone, 
'""10— ~^Hbr ionic gonadotropin, hypothalamic releasing factors, antidiu- 
retic hormones, thyroid stimulating hormone, relaxin, interferon, 
thrombopoeitin (TPO) and prolactin. 

A requirement to the starting parental DNA sequences, e.n- 
coding the polypeptide ( s) , to be shuffled, is that they are ac 
15. least 50%, 60%, 70%, 801, 90%, or 95% homologous. DNA sequences 
being less homologous will have less inclination to interact and 
. re combine . 

It is also contemplated according to the invention to shuf- 
fle parental polynucleotides that are homologous as indicated 

20 above originating fro™. v;ilc type organisms of different genera. 

rurther, the srartinc parental templates to be shuffled may 
preferably have a length of from about 50.'bp to 20 kb, preferably 
about 100 bp to 10 kb, more preferred about 200 bp to 1 kb, espe- 
cially about 400 bp to 2 kb. 

25 The starting parental DMA sequences may be any DNA se- 

quences including wild-type DMA sequences, DNA sequences encoding 
variants or mutants, or modifications thereof, such as extended 
or elongated DNA sequences, and may also be the outcome of DN.^ 
sequences having been subjected to one or more cycles of shuffi- 

30 ing (i.e. output DNA sequences) according to the method of the 
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invention or any other method (e.g. any of the methods described 
in the prior art section) . 

When using the method of the invention the resulting recon- 
bined homologous polynucleotides (i.e. shuffled DMA sequences), 
have had a number of nucleotide fragments exchanged. This results 
in replacement of at least one amino acid within the polypeptide 
variant, if comparing it with the parent polypeptide. It is to be 
understood that also silent exchanges are contemplated (i.e. nu- 
cleotide exchange which does not result in changes in the amino 
acid sequence) . 

MATERIALS AND METHODS 

Specific .Method Used in the Examples: 

A) DMA encoding different enzyme variants of the. same gene ;or 
different enzymes having the same type of activity, encoded by 
homologous genes are mixed. The DNA , is provided as . either PGR 
fragments, plasmid, phage or ger.oriic DMA. 

B) The resulting pool of D:^.n is mixed with^. DNA Polymerase, 
dMT?, .a suitable buffer and primers (being either random oli- 
gomers (length of 6-30 nucleotides) or specific oligomers 
(lencth of 6-50 nucleotides) or a corrJDination of both types). 

C) The PGR mixture i'S put into a PGR thermocycler (either 
cold or hot) in a suitable tube. 

cl) The thermocycler is heated to a temperature of 9C- 
100°C for a period of time ( typically 1-10 min) in order to dena- 
ture the DMA templates. 

c2) Thereafter the following procedure (cycle) is fol- 
lowed (repeated): The template is denatured (typically 90-100°C 
for 0-5 minutes). Then the ter.perature is lowered (typically tc 
a value between lO'^C and 90'C for 0-5 minutes) to allow, annealinc 
of the primer to the single stranded template. Now the tempera- 
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ture is raised again to denaturation temperature (90-100'C) a 
lowing small extension of the primer to be synthesized by t 
DNA polymerase during ramping. Alternatively a short extensi 
period (typically 0-30 seconds at 70-75**C) can be introduced ' 
allow larger extensions of the primers to be generated. When tr 
temperature reaches a value where denaturation takes place, th 
extended primers and templates are again separated. This proc= 
dure can be repeated (typically between 1 to 99 cycles). 

Having performed the desired number of cycles the gener 
ated small DNA polymers can be purified from the oligoners us = 
as primers. One way is to isolate and clone a specific amplifie 
band containing the gene coding for the polypeptide of interes 
into a suitable vector. This can be done either on agarose ge 
(typically isolating fragments between 50 to 1000 base pairs; 
by beads (using an affinity label on either templates or pri- 
ers) or through columns. 

E) Then the purified (or the not purified) DMA polymers C£. 
be assembled in a standard PCR reaction (for instance 94°C, 
minutes, (94°C, 30 sec; 55°C, 30 sec; 72^, 2 min)* 25, 72^0 
minutes, 4°C) . 

Specific primers or DNA poi^^.ers generated by specifi 
primers can be added in order to generate a specific DNA polyme 
containing the gene of interest. This As mentioned in point D 
this DN'A polymer can be purified and cloned into a vector of in 
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"""'tL strong advantage, of the .ethod exemplified here is the 

.o.ustnes3 and lac. of PCR contamination problems, due to the 

constant presence of parental template. Furthermore thxs method 

is less labor demanding than methods described in the pr.or art, 

thereby providing excellent possibilities for automation. 

Wine different plasmids containing DNA sequences encoding 

•=nr<: of the H. lanuginosa lipase gene, were 
9 different variants ot tne n. » . ^ , 

ThP variant genes contained from two 
mixed in equimolar amounts. The varianc g 

-^seven mutations scattered throughout the gene. 

The om sequence of the H. lanuginosa lipase gene and th. 
.™,.o 3cid sequence of the lipa.se are disclosed in BP 0 305 21 

The variants are indicated according to standard terminol- 
ogy as described in e... EP 0 396 608, and WO 97/07202. ^ 

The following 9 variant genes were shuffled: 



!■ M94K-i-D96L+E99K 

2'. sp?RRP+N94K.D96L.T231RH:233P-.D234R.Q249R 
20 3'. S??R?.?+A19C+C36A-rN9<K+D95L^Q2';9R 
4 ' . STPRRP+M94R 

5'. SCIRR-M94K+D95L+I239C+Q249R 
6' . D137G+D167G+E210V 
T . D96L+E99K-i-V187A 
25 8' . SPPRRP+D57G+M94K+D96L+Q249R 

9- . M94R+r95L 
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= T-iv^a in a microtube: 

Tn° following components w.-.e-e ^ ^ 

• . ,r ua/ul), soecific primers flan.s- 
2 ^1 Dlasmid mixture (0. id ^g/HD . . 

7 2 5 mM dMTP, 2.5 mM MgCU, 2 \il 
^ng.the gene (1 pmol/^D , ^ ^ ■ = 
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10* taq buffer (Perkin Elmer), 0.5 ^1 taq enzyme in a total vol- 
ume of 20 fil. 

The tube was set in a Perkin Elmer 2400 thermocycler . The 
following PCR-program was run: (94''C, 5 minutes) 1 cycle: 
5 (94'C , 30 seconds, 70°C, 0 seconds) 99 cycles (72**C, 2 minutes, 
4''C indefinite) 1 cycle 

The PCR-reaction was run on a 1.5 % agarose gel. A DNA- 
band of the specific expected size was cut out of the agarose 
^~~»igel and purified using JETsorb (from GEMOMED Inc.). The purified 
10 PC.R-product was cloned into a TA-vector (from Invitrogen (the 
original TA. cloning kit) . The ligated product was transformed 
into a standard Escherichia coli strain (DHSa) . 

20 transformants where fully sequenced across the gene o: 
interest . 

15 

Result; 

The following 20 variants were found: 

1. D137G+D167G+E210VtY213C 

2. S??RR?TD57G-rM94:<+D96LrQ2^;9R 
20 3, N9^R+r95L 

4. S??RR? + D137GrDl67G-f-£210V 
5 . N 9 4 K-^ D9 dL^E9 9 K+V 1 8 7 A-^T2 67 I 
6. Di37GTD167G+£210V 
7 . ^]94K+D9DLTE99KTV187A 
25 8. D57G+N94R+r95L+Q249R 

9. N94K+D96L+E99K-^E210V 

10. S?PRR?-!-A19C+C36A+N94K+D95L 

11. N94R+r95L 

12. DX37G-r0167G-fE2iOV 
30 13.' K94K-rD96L-^Q249R 
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14 . SPPRRP+Q15P+A19C+C36A+N94K+D96L 

15. SPPRRP+N94K+D96L+T231R+N233R+D234R+Q249R 

16. D137G+D167G+E210V 

17 . SCIRR+N94K+D96L+Q249R 

18. N94K+D96L+E99K 

19. N94R+F95L 

20. SPPRRP+N94R+F95L+F113S+Q249R 

Nearly all mutations where represented (19 of 20} indicat- 
ing little bias for specific templates. 



Statistics: 


Not shuffled 






10 


Shuffled between at 


least 


2 templates 


8 


Shuffled between at 


least 


3 templates 


2 



Tho sV'uf fled sequences can then be subcloned from the E.ccli TA 
vector into the yeast vector pJS026 as a - BanHI-Xbal fragment 
(see WO 97/07205), and e.g. screened 'for new shuffled sequences 
■ with inoroved performance in detergents (see WO 97/07205). 



Method 2: 

. PGR products of 10 different lipase variant genes were 
generated as above and pooled in equimolar amounts. 

The following 10 mutant genes were shuffled. 

!• . D137G+Dl67G+t:210V 

2'. D96L+H:99K+V187A 

3' . .M94K+D96L+E99K 
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4 • . SPPRRP+D57G+N94K+D96L+Q249R 

5' . D111N+F211A+G225P 

6' . SPPRRP+N94K+D96L+T231R+N233R+D234R+Q249R 

7 • . SPPRRP+A19C+C35A+N94K+D96L+Q249R 

5 8'. STPRRP+N94R 

9', N94R+F95L 

10', SCIRR+N94K+D96L+E239C+Q249R 

The following mixture was generated in a suitable tube: 
10 1 |il PCR mixture {0.1 pg), decamer random priner (3Cj 

pmol), 2 ^il 10* Klenow buffer (Promega), 0.25 ruM dNT?, 2.5 nl< 
MgCl, in a' total volume of 20 ^1. 

The mixture was set in a PE:2400 thermocycler where the 
following program was run: 96°C, 5 minutes, 25''C 5 minutes, 0.5 
15 ' ml Klenow enzyme was added, 25°C 60 minutes, 35°C 90 minutes. 

This procedure generated a high nurrJoer of small DMA poly- 
mers originating from all parts of the gene 

10 pil was taken out for test on agarose gel. 
10 |il PCR mixture (0,25 rr-M cMTP, 1 |il 10* Taq buffer 
20 (Perkin Elmer), 2.5 rr:-: I'.gZl., 0.5 ^1 Taq enzyrr^e) was added to 
the 10 )il in the tube in the thermocycler. Then the following 
standard PCR-program was run: (9<°C, 5 minutes) 1 cycle, (94°C 3D 
seconds, 45°C, 30 seconds, 72°C 30 seconds) .25 cycles, 72°C 1 
minutes, 4*C indefinite. 
25 The PCR products were run on a 1.5% agarose gel. A clear 

unbiased smear was seen. DHA between 400 and 800 bp was isolated 
from the gel. 

Half of the purified ?Z?. product was mixed in a tube wiih 
two specific primers (40 p~ol) flanking the gene of interest, 
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0. 25 mM dNTP, 2 fil 10* Taq buffer, 2.5 nuM MgCl^. Then the fol- 
lowing standard PCR-program was run: (94'»C , 5 minutes) 1 cycle, 
(94<»C 30 seconds, 50^C, 30 seconds, 72°C 30 seconds) 25 cycles, 
72°C 7 minutes, indefinite. 

5 The PCR product was run on a 1.5% agarose gel. A specific 

though weak band of the expected size was isolated. Additional 
PCR was run using specific primers (as mentioned above) in order 
to amplify the PCR-product before cloning. 

_ ..-^^ The PCR-product and the desired vector were cut with the 

10 appropriate restriction enzymes (BamHI/XhoI) . The vector and the 
.PCR product were run on a 1.5% agarose gel, and purified fro- 
the gel . 

'The cut PCR-product and the cut vector were mixed in a li- 
gase buffer with T4 DNA ligase (Promega). After overnight liga- 
15 tion at 16*'C the mixture was transformed into E. coli strain 
PHSa/ 

" * 19 clones were fully sequenced across the -gene. 
Result: . 

20 The foliov/ing 19 variants were found: 

1 . STPRR? + N94R-5-H233R-fD234RTQ249R 

2 . SP?RR? + M233R+D23<R-^Q249R 
• 3. D96LjDl67G+Q249R 

25 4 . M94R 

5. D167G . 

6. SPPRR?+A19C+A28T-rN94K+D9 5L-^DlllM + E;2 39C 

7. SPP.RR?rAl9C+C36A-rN94KrD9 5L-f02 4 9R 

8. M94KrD96L 

30 9. M-25T + D57G + M94R-f-c:99K-D167G-T231R-N233R-D2 34R+Q24 9R 
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. 10. N94R+Q249R * Q 

11. D167G 

12. D167G 

13. T32I+N94R+F95L+D167G+Q249R 
5 14. E87K+N94K+D96L 

15. N94R+F95L+Q249R 

16. N94K+D96L+D111N 

17. STPRRP+S17T 

18. N94K+D96L+V187A 

Yq^^'^Ys. SPPRRP+D57G+N94K+D96L+D111N+L151S 



All template variants were represented indicating little 
bias for specif ic 'templates . 

There were no apparent hot spots with regard to mutatior. 
15 exchange and it seens to be evenly distributed along the gene 



Statistics : 




Not shuffled 










1 


Shuffled between 


at 


least 


2 


templates 


10 


Shuffled between 


at 


least 


3 


te^.plates 


6 


Shuffled between 


at 


least 


4 


templates 


1 


Snuffled between 


at 


least 


5 


teT.plates 


0 


Shuffled between 


at 


least 


6 


templates 


1 



The shuffled sequences can then be subcloned from the 
E.coli TA vector into the yeast vector pJS026 as a Barr.HI-Xbal 
20 fragment {see WO 97/07205), and e.g. screened for new shuffled 
sequences with improved performance in detergents (see WO 
97/07205) . 
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Example" 3: 

Amylase variant shuffling: 

la Example 1, it was shown how a number of multiple vari- 
ants of H. ianuginosa lipase were shuffled. In a similar manner, 
variants of Bacillus a-amylases can be shuffled. 

Earlier patent applications have identified variants of 
various a-amylases from Bacillus species improved for particular 
properties, e.g. thermostability, stability under Calcium- 
depleted conditions, improved wash-performance etc. (see 
WO95/10603, W096/2387., W096/23873, and PCT/DK97/00197, . 

variants of B. licnenifor.is a-amylase amyL can be shuf- 
fled as follows. The variants-are all located in the B. sub.ii.s 
exoression vector pDN1528 described in V,O95/10603. 

The exoeriment is carried out under the exact same cond.- 
:.cioas.as Example 1 except that the f landing 27mer primers used 

■t-o initiate ONA synthesis were different. ... 

' ■'CR anolified baP.d of approximately 1500 bp purx- 

-ied from an agarose gel and cloned as described in Exa.plel.-. 
Alternatively restriction, sites located within the amy. gene car. 
b» utilized to Clone the library of shuffled genes into e.ther 
a'acUlus plasmid pDN1528 or an E. coli vector containing the 

„i = -Mi described in VI096/23874 
wild type amyL gene, e.g. pJe^Nl d-scrio 



Example 4: 
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Shuffling of two genes encoding homologous a-amylases: 
amyL and the amylase identified by SEQ.ID no2 {amino acid) and 
SEQ.ID no. 5 (DMA) described in W096/23873. 

The forward strand (identical to the mRNA) of amyL can be ampli- 
fied in a PGR using standard conditions. 

The forward strand is separated from the reverse strand 
based on its affinity to streptavidin coated magnetic beads and 
denaturation of the two strands with NaOH. 

Similarly the reverse strand (complementary to mRNA) of the amy- 
lase encoded by SEQ.ID no5 {W096/23873) can be amplified and 
isolated 

Two primer strands are used as templates in a PGR: {94**c 5 
minutes} + 99 cycles of (94 "C, 30 seconds; eO'G, 0 seconds}-!- 
{12°C, 5 minutes) using random primers of various lengths, Taq 
polymerase and standard buffer conditions as described in Exam- 
ple 1. 

The resulting approxi-ately 1500 bp product .is cloned ei- 
ther as TA cloning as described in Example one (for verification 
of the sequence of the resulting clone) or into Bacillus vector, 
pTV3110 utilizing Sfil and ?stl restriction sites. 

The original template can be removed from the PGR at any 
step (e.g. after 5, 10 or 20 cycles) based on the biotin tag). 
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1. a method for the construction of a library of recombined 
homologous polynucleotides from a number of different starting 
single or double stranded parental DNA templates and primers by 
induced template shifts during an in vitro polynucleotide syn- 
thesis using a polymerase, whereby 

A. extended primers or polynucleotides are synthesized by 

a) denaturing a parental double stranded DNA template to 
produce single stranded templates, 

b) annealing said primers to the single stranded DNA 
templates, 

c) extending' said primers by initiating synthesis by use 
of said polymerase, 

'd) • cause arrest of the synthesis, and 
e) - denaturing the double , strand to separate the extended 
■ ■ primers from the texplates, 
B/ a template shift is induced by 

a) isolating the newly synthesized single stranded ex- 
tended primers fron the templates and repeating steps 
A.b) to A.e) usip.g said extended primers produced in 
(A) as both primers and templates, or 

b) repeating . steps A.b) to A.e), 

C. the above process is terminated after an appropriate num- 
ber of cycles of process steps A. and B.a), A. and B.b), 
or corrhinations thereof, and 

D. optionally the produced polynucleotides are amplified in a 
standard PCR reaction with specific primers to selectively 
amplify homologous polynucleotides of interest. 

2 The method of claim 1, wherein said polymerase is a ther- 
mostable DNA polymerase, such as Taq, Amplitaq, Vent, Pwo. 
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3. The method of claim 1, wherein said polymerase is a DMA 
polymerase, such as T4 polymerase, T7 polymerase , E. coli DMA 
polymerase I or the Klenow fragment of DNA polymerase I, and 

5 said polymerase is added to the reaction mixture after each cy- 
cle. 

4. The method of claim 1, wherein said polymerase is a RNA 
^ ^^^^ go lymerase . 

10 

5. The method of any of the claims 1 to 4 , wherein said po- 
lymerase is an error-prone polymerase, such as a reverse zran- 
scriptases like HIV reverse transcriptase. 

15 6. The method of any of the claims 1 to 5, wherein said 
starting single or double stranded parental templates are dif- 
ferent in that they contain different point mutations in the 
same native polynucleotide (gene), or are homologous pqiynu- 
cleotices (genes) isolated from nature. 

20 

1. The method of claim. 5, wherein said starting single or 
double stranded parental templates exhibit an identity at the 
DNA level of e.g. more than 95%, 90%, 8.5%, 80%, 75%, 70%, 65%, 
60%, 55%, or even more than 50% identity. 

25 

8. The method of any of the claims 1 to 7, wherein the ar- 
rest of the polymerase reaction in step A.d) is achieved by 
raising the temperature. 

30 9. .The method of claim 8, v;herein the temperature is raised' 
to a value between 9Q°C and 99°C, preferably 94*'C. 
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10 The method of any of the claims 1 to 7, wherein the ar- 
rest of the polymerase reaction in step l.d is achieved by add- 
ing specific reagents, such as DMSO. 

n. The method of any of the claims 1 to 10, wherein said 
primers comprise a population of completely random primers. 

12. The method of any of the claims 1 to 10, wherein said 
"m'ers comprise a population of semi-random primers. 

13. The method of any of the . claims 1 to 10, wherein said 
primers comprise a population of specific primers. 

1, ^ The method of any of the claims 1 to 13, wherein said 
.primers are labeled, preferably with biotin or digoxigenin. , 

The'method of any of the cla.ns 11, 12 or 1.4,. wherein 
said random or se.i-random primer, have a length of from 6 to 
500 -co, oreferably from 15 to 300 bp, and better from 25 to 150 



bo. 



16. ■ The method of any of the claims 13 or 14, wherein said 

priip.ers are 6 to 75 b? long. 

17 The method of any of the claims 1 to 16, wherein said 
starting parental DN.. templates are at least 50%. 60%, 70%, 

80%, 90%, or 95% homologous. 

18. .The method of claim 17, wherein said starting parental 

. . , ^.oH rvne o^aanisms of different 

DMA templates originate rro.. v.. Id ^y?e o.g<i. 

genera 
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19. The method of any of the claims 17 or 18, wherein said 
starting parental templates have a length of from about 50 bp 
to 20 kb, preferably about 100 bp to 10 kb, more preferred 

5 about 200 bp to 7 kb, especially about 400 bp to 2 kb 

20. The method of any of the claims 1 to 19, wherein said 
starting parental templates are linear DNA fragments generated 
by a PGR reaction. 

To" 

21. The method of any of the claims 1 to 20, wherein said 
starting parental templates are . cloned into suitable vectors, 
such as a plasmid. 

'15 ■ 22. The method of any of the claims 1 to 21, wherein said 
starting parental templates represent a polynucleotide encoding 
an enzyme, such as a carbonyi hydrolase, carbohydrase, or" es- 
terase, such as a protease, lipase, amylase, cellulase, oxi- 
dase, oxido reductase, etc, 

20- 

23. The method of any c: the claims 1 to 21, wherein said 
starting parental templates represent a polynucleotide encoding 
a polypeptide having biological activity, such as insulin, 
ACTH, glucagon, somatostatin, somatotropin, thymosin, parathy- 

25 roid hormone, pituary honr^ones, somatomedin, erythropoietin, 
luteinizing hormone, chorionic gonadotropin, hypothalamic re- 
leasing factors, antidiuretic hormones, thyroid stimulating 
hormone, relaxin, interferon, thro^JDopoeitin (TPO) and prolac- 
tin. 

30 

24. The method of any of the claims 1 to 21, wherein said 
starting parental templates represent a polynucleotide contain- 
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ing a biological function such as transcription initiation 
(promoter) or termination, translational initiation, operator 
sites related to expression of genes 

25. The method of any of the claims 1 to 24, wherein said 
number of cycles are less than 500, 200, or less than 100 cy- 
cles . 

26. A method of identifying a polypeptide of interest exhib- 
Tting improved properties in comparison to naturally occurring 

or other known polypeptides of the same activity, whereby a li- 
brary of recombined homologous .polynucleotides produced by a 
process according to any- of the claims 1 to 25 are cloned into 
an appropriate vector, said vector is transformed into a suit- 
able host system, to be expressed into the corresponding 
polypeptides and displayed, and said polypeptide is screened ^in 
a suitable assay, and positive polypeptides selected. 

27. A method for producing a polypeptide of interest as iden- 
tified according to claim 26, whereby a vector comprising a 
polynucleotide encoding said identified polypeptide is trans- 
formed into a suitable host, said host is grown to express said 
polypeptide, and the polypeptide recovered and purified. 
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